Quantitative Comparisons between Time Domain Speech Fundamental Frequency Estimation Algorithms

نویسندگان

Ian S. Howard

David M. Howard

چکیده

. T W O techn iques a r e presented here t o enable q u a n t i t a t i v e comparison o f t i m e domain fundamental f requency e s t i m a t i o n a l g o r i t h m s a g a i n s t a r e fe rence , t h a t makes use o f t h e o u t p u t f rom a laryngograph. These measures a re c a r r i e d o u t on t h e p u l s a t i l e ou tpu t s produced by t h e dev ices, where each pu lse corresponds t o an epoch o f a c o u s t i c e x c i t a t i o n due t o a v o c a l f o l d c l o s u r e . The r e s u l t s g i ven he re a r e f o r a peak-p ick ing a lgor i thm. The comparison techn iques are: 1 ) Receiver ope ra t i ng c h a r a c t e r i s t i c . Th i s i s a p l o t o f t h e p r o b a b i l i t y o f success fu l d e t e c t i o n o f a voca l f o l d c l osu re , as compared t o t h e re fe rence , a g a i n s t t h e number o f f a l s e a la rms. It i s shown t h a t t h i s measure g i v e s a c l e a r i n d i c a t i o n as t o how w e l l t h e dev ice under t e s t per forms w i t h r espec t t o t h e re fe rence , as w e l l as p r o v i d i n g a q u a n t i t a t i v e method f o r dev ice parameter o p t i m i s a t i o n . 2 ) J i t t e r d i s t r i b u t i o n . Th i s i s a h is togram o f t h e d i f f e r e n c e s i n t h e t imes o f occurence o f o u t p u t pu l ses f rom t h e re fe rence and t h e corresponding t ime -a l i gned pu lses f r om the dev ice under t e s t . Th i s measure g i ves an i n d i c a t i o n o f hovd p r e c i s e l y and c o n s i s t e n t l y dev ices a r e ab le t o l o c a t e v o c a l f o l d c l o s u r e i n s t a n t s .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real-time fundamental frequency estimation by least-square fitting

The real-time performance of a fundamental frequency estimation algorithm depends not only on its computational eeciency but also on its ability to obtain accurate estimates from short signal segments. Previous frequency-domain algorithms make use of spectral analysis algorithms that require the application of a window function, which cause them to fail when signal segments are short and their ...

متن کامل

Robust algorithms for speech reconstruction on mobile devices

This thesis is concerned with reconstructing an intelligible time-domain speech signal from speech recognition features, such as Mel-frequency cepstral coefficients (MFCCs), in a distributed speech recognition(DSR) environment. The initial reconstruction methods in this thesis require, in addition to MFCC vectors, fundamental frequency and voicing information. In the later parts of the thesis t...

متن کامل

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...

متن کامل

Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks

In this paper, we present a study on robust pitch estimation by integrating spectral and temporal information in speech. Spectrum harmonics are important representations of the speech fundamental frequency. Harmonic-related spectral peaks of speech evolve much more slowly than the spectral peaks of noise. This motivates the proposition of temporally accumulated peak spectrum (TAPS), which is co...

متن کامل

A Spectro-Temporal Demodulation Technique for Pitch Estimation

We consider a two-dimensional demodulation framework for spectro-temporal analysis of the speech signal. We construct narrowband (NB) speech spectrograms, and demodulate them using the Riesz transform, which is a two-dimensional extension of the Hilbert transform. The demodulation results in timefrequency envelope (amplitude modulation or AM) and timefrequency carrier (frequency modulation or F...

متن کامل

Smooth Cepstrum Calculation Using Modified Bartlett Hanning Window

Cepstrum is an algorithm for analyzing the speech signals in frequency domain. This is conventional method of fundamental peak picking i.e. fundamental frequency or pitch. For a speech signal it is necessary to identify the fundamental frequency correctly in order to have robust system for speaker identification and verification. Using this approach two algorithms has been proposed using Hammin...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Quantitative Comparisons between Time Domain Speech Fundamental Frequency Estimation Algorithms

نویسندگان

چکیده

منابع مشابه

Real-time fundamental frequency estimation by least-square fitting

Robust algorithms for speech reconstruction on mobile devices

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks

A Spectro-Temporal Demodulation Technique for Pitch Estimation

Smooth Cepstrum Calculation Using Modified Bartlett Hanning Window

عنوان ژورنال:

اشتراک گذاری